An analysis of the causes of increased error rates in children²s speech recognition
نویسندگان
چکیده
Previous studies have shown that children’s speech is more difficult to recognize by machine than adults’ speech. This paper presents the results of experiments which investigate recognition performance variation within a small population of children. Results suggest that recogniser performance on a child’s speech is well correlated with a teacher’s assessment of the child’s speaking proficiency. For children whose speech is judged to be good, performance is close to that of adults, but error rates increase by a factor of 4 for children with ‘poor’ speech. An analysis of actual pronunciations for children with poor speech shows significant divergence from the ‘idealised’ baseforms in a pronunciation dictionary. It is demonstrated that some improvements can be gained through the use of customized dictionaries. Finally, the effects of bandwidth reduction on recogniser performance are investigated for a range of children with differing speaking styles.
منابع مشابه
Correlation between Auditory Spectral Resolution and Speech Perception in Children with Cochlear Implants
Background: Variability in speech performance is a major concern for children with cochlear implants (CIs). Spectral resolution is an important acoustic component in speech perception. Considerable variability and limitations of spectral resolution in children with CIs may lead to individual differences in speech performance. The aim of this study was to assess the correlation between auditory ...
متن کاملPicture Naming in Children With and Without Specific Language Impairment: An Error Analysis Study
Objectives: The lexical-level deficits are one of the hallmark limitations observed in early school-age children with specific language impairment. These problems are the predictors of reading problems and poor performance at school. Most studies in Persian-speaker children with specific language impairment have focused on syntax and morphology and the naming errors profile of these children ar...
متن کاملAuditory Brainstem Response Wave Amplitude Characteristics as a Diagnostic Tool in Children with Speech Delay with Unknown Causes
Speech delay with an unknown cause is a problem among children. This diagnosis is the last differential diagnosis after observing normal findings in routine hearing tests. The present study was undertaken to determine whether auditory brainstem responses to click stimuli are different between normally developing children and children suffering from delayed speech with unknown causes. In this cr...
متن کاملSpeech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions
Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...
متن کاملStatistical Variation Analysis of Formant and Pitch Frequencies in Anger and Happiness Emotional Sentences in Farsi Language
Setup of an emotion recognition or emotional speech recognition system is directly related to how emotion changes the speech features. In this research, the influence of emotion on the anger and happiness was evaluated and the results were compared with the neutral speech. So the pitch frequency and the first three formant frequencies were used. The experimental results showed that there are lo...
متن کامل